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Measurement of the amplitudes of both the E and B components of the CMB polarization will 
open new windows onto the early Universe. Using a Fisher-matrix formalism, we calculate the 
required sensitivities and observing times for an experiment to measure the amplitudes of both 
E and B components as a function of sky coverage, taking full account of the fact that the two 
components cannot be perfectly separated in an incomplete sky map. We also present a simple 
approximation scheme that accounts for mixing of E and B components in computing predicted 
errors in the E-component power spectrum amplitude. In an experiment with small sky coverage, 
mixing of the two components increases the difficulty of detecting the subdominant B component by 
a factor of two or more in observing time; however, for larger survey sizes the effect of mixing is less 
. pronounced. As a result, the optimal experimental setup for detecting the B component must cover 

an area of sky significantly larger than is found when mixing is neglected. Surprisingly, mixing of 
E and B components can enhance the detectability of the E component by increasing the efi'ective 
number of independent modes that probe this component. The formalism presented in this paper 
can be used to explore ways in which survey geometry and nonuniform noise due to uneven sky 
coverage will affect detectability of the two components. 
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PACS numbers; 98.70.Vc,98.80.Es,98.80.Cq,98.80.-k 



Ph; I. INTRODUCTION 

O , 

, The cosmic microwave background radiation (CMB) contains a wealth of information about the high-redshift 
^ . Universe. Efforts to map the temperature anisotropy and estimate its power spectrum have already revolutionized our 
understanding of cosmology, strongly suggesting that the Universe is spatially flat and constraining other cosmological 
parameters (e.g., |]l], |[ ^, ^, |[ ||, 0). Future anisotropy measurements, especially the MAP [8] and Planck [9] satellite 
missions, will allow the cosmological parameters to be determined with unprecedented precision (e.g., [Tl|). 
Polarization will be the next frontier in the study of the CMB; considerable effort is already underway [e.g.. 



d • |T^, |TJ])to map the CMB polarization structure. That the CMB should be polarized is an extremely robust 

prediction |l^ of the gravitational instability paradigm of standard cosmological theory, so detection of CMB 
polarization will provide a reassuring confirmation of the standard model. Moreover, a large amount of additional 
information is expected to be lurking in CMB polarization maps. Measurement of the CMB polarization power spectra 
will help in breaking certain degeneracies between cosmological parameters [|l7, 18, 1^, 0. But perhaps most exciting 



of all is the possibility that CMB polarization measurements will contain the signature of primordial gravity waves 
(tensor perturbations) produced during an inflationary epoch |21, 2^, If these hopes are realized, we will have far 



more direct evidence than we currently do that inflation actually occurred, and we will have probed the Universe at 
far earlier times than any current observations. 

Detecting the gravity-wave signal in a CMB polarization data set will be a daunting task: the tensor contribution to 
the CMB polarization is expected to be considerably smaller than the contribution due to scalar density perturbations. 
Fortunately, the tensor contribution has a geometrically different signature from the scalar contribution, allowing 
the two to be separated. To be specific, any polarization map can be split into two components ^: the "E 
component," which transforms as a scalar, and the "B component," which transforms as a pseudoscalar.^ Scalar 
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^ The division of a polarization field into E and B components is closely analogous to the division of a vector field into curl-free and 
divergence-free parts. This is the reason for naming the components after the electric and magnetic fields. (Of course it is hardly 
necessary to add that these names have nothing to do with the actual electric and magnetic fields of the radiation.) Elsewhere in the 
literature, the two components are called G and C, for "gradient" and "curl" respectively. If the world needed yet another name for 
these components (which it certainly doesn't), they could be called + and X; as we will see below, these give the orientation of the 
polarization with respect to the wavevector in Fourier space. 



FIG. 1: Gaussian "hot spots." The left panel is purely E-component, and the right panel is purely B-component. 



density perturbations produce only the E component, leaving the B component as a clean probe of subdominant 
sources of polarization such as tensor perturbations.^ 

Given a full-sky polarization map, it is possible to separate the E and B components perfectly; with incomplete sky 
coverage, however, there is inevitably some cross-contamination between the components. This naturally makes detect- 
ing the B-component more difficult, as it is in danger of being swamped by the (typically much larger) E-component. 
In this paper, we will examine the experimental requirements to detect both E and B-component polarization signals 
in a degree-scale experiment, accounting for this cross-contamination. 

One approach to separating E from B is to observe in a circular ring [p4| , ^5| , ^ . The separation of components is 
particularly clean in this case, but a strategy involving a two-dimensional map is likely to be much better for measuring 
power spectrum amplitudes P7| , as many more independent modes at a given scale can be found in the data. Attention 
has therefore been paid to finding normal modes that minimize the complications due to E-B cross-contamination in 
a two-dimensional map |2^, In the present paper, we adopt a more straightforward approach: we consider the 
likelihood function of a polarization map in pixel space and compute the Fisher matrix for the normalizations of both 
E and B power spectra. Since this "brute-force" approach is based on the likelihood function of the full data, it must 
be at least as good as (i.e., give as small error bars as) any method based on an expansion in normal modes. 

The remainder of this paper is organized as follows. In Section |l^, we review some properties of polarization maps 
and illustrate the difficulties in splitting a partial-sky map into E and B components. In Section III, we present the 
Fisher-matrix formalism we will use to determine the detectability of the two components in a given experiment and 
also present two simple approximation schemes for determining the detectability. Section ^ presents our results, and 
Section ^ contains a brief discussion. 



II. THE E-B DECOMPOSITION 



In this section we describe the nature of the E-B decomposition of a polarization field. This description makes no 
iretense of completeness; much more information on this subject can be found in the literature. See in particular 
29], ^ and references therein. 



In addition to tensors, vector perturbations also produce B-component polarization; however, because vector perturbations decay over 
time, they are not expected to contribute significantly to the observed polarization, at least in the inflation-inspired models we consider in 
this paper. In some models (topological-dofect models in particular), vector perturbations may be sourced at times close to recombination 
and may therefore be observable, but we do not consider such models in this paper. 



FIG. 2: The left panel is simply one quadrant of the E-component hot spot shown in Figure The center and right panels are 
a pure E and a pure B component respectively, which sum to equal the left panel. 



A. The flat-sky approximation 



Polarization is a spin-2 quantity (i.e., it is invariant under 180° rotations). The linear polarization expected to be 
found in the CMB is described by the two Stokes parameters Q and U, which are related to the magnitude P and 
direction (p of the polarization as follows: 

Q = P cos 20, (1) 
U = Psm2(j). (2) 

There are several ways to describe the division of a polarization map into E and B components. One simple way is 
to examine a small patch of the sky, which can be well approximated as flat. In this approximation, an E-component 
polarization field is one whose Stokes parameters satisfy the following equation: 

while a B-component field satisfies this equation: 

(These equations are simply the spin-2 analogues of the equations V x v = and V • v = for the scalar and 
pseudoscalar components of a vector field.) 

An arbitrary polarization field can always be written as a sum of an E and a B component. One easy way to see 
this is to work in Fourier space. If we consider a mode with spatial dependence e*'' '', then equation (H) implies that 
the E component has a polarization that is always parallel or perpendicular to k. In terms of the Stokes parameters, 
it can be written 

^/cos2^k\ ,:k.x^ (5) 



U J ^ \ sin 2(/>k 

where (/)k is the angle k makes with the x axis. Similarly, a i?-mode has a polarization direction that is always at a 
45° angle to k. It can be obtained by simply rotating the polarization of an E mode through 45° at each point: 

rM «l '%^J'^]e'^-. (6) 
[/ y ^ y cos 2(/)k ' 

To decompose an arbitrary map into E and B components, therefore, we simply take the Fourier transforms of Q 
and U and then, for each wavevector, project (Q, U) onto the axes defined by equations (^) and (||). Of course, to 
perform this decomposition, we need to know Q and U over all space. If we only know Q and U over a finite region, 
the Fourier transform and hence the E-B split will depend on the boundary conditions we assume. 

Figure |l] shows Gaussian "hot spots" for both the E and B components. Note that the B component, unlike the 
E component, has handedness; this is the reason that density perturbations, which lack handedness, cannot excite 
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this component. Of course, we could generate another pah of E, B maps by replacing (Q, U) by (— Q, —U), which is 
equivalent to rotating the polarizations in these plots through 90° at each point. 

To illustrate the ambiguity in the E-B decomposition of an incomplete data set, suppose that we have observed 
only one quadrant of the E-component hot spot, as shown in the left panel of Figure |. This is of course a pure E 
polarization field: it satisfies equation (j^) at every point. However, it is also the sum of the E and B polarization fields 
shown in the center and right panels. These fields were produced by Fourier transforming the region covered by the 
data with periodic boundary conditions and splitting each Fourier mode into E and B parts according to equations 
and (|^). There are infinitely many other ways the decomposition could have been done; for instance, the data 
could have been padded with zeroes on all sides before Fourier transforming. 

In this particular example, the "contamination" of a pure E component by B is not a small effect: the r.m.s. 
amplitude of the B component shown in Figure ^ is 0.87 times that of the E component. This is because the map 
being decomposed has a large amount of power power on scales of order the width of the map. In general, we can 
expect a large amount of cross-contamination between E and B on the largest scales probed by a given data set, with 
much less contamination on smaller scales. The reason for this is simple: the ambiguity in the E-B decomposition is 
a result of our ignorance of the boundary conditions to impose on the two components, so the natural length scale 
associated with E-B mixing is the width of the map. 



B. Beyond the flat-sky approximation 

Although the flat-sky formulae are in general simpler to work with, for many applications wc need to consider the 



exact, full-sky formulae. In this section we will very briefly summarize this formalism; see ]21| , |22| , 27, ^ for further 
details and useful identities. 

Since polarization is a spin-2 quantity, the natural basis functions to use in expressing the Stokes parameters {Q,U) 
on the sphere are the spin-2 spherical harmonics. Specifically, we can write 

oc / 

a=F2,im =F2ilm, (7) 

1=2 m=-l 

where ip2^im is a spin-2 spherical harmonic. Detailed information on the spin-2 spherical harmonics can be found in 
the sources cited above. For our purposes, all we need to know is that decomposing a polarization field into E and 
B components is quite simple in the spherical harmonic basis: the coefficients a^2,im in the expansion can simply be 
written 

a±2,lm — Ei,n ± iBim- (8) 

This, combined with the reality condition a*_2 i„i ~ f)™a2.i-m, allows one to determine Eim and -B;„i from the 
spherical harmonic coefficients. 

If we assume that the polarization is a realization of a statistically isotropic random process {i.e., that there is no 
preferred direction), then the ensemble averages (Eim) and {Bim) must both vanish, and the covariances must satisfy 

(ElmEi,„^,) — CfSll'Smm', (9) 
{BlrnBi,„^,) = CP Sii'dmm' ■ (10) 

If in addition the random process is parity-invariant (lacks handedness), then 

{EiraBt,^,) = 0. (11) 

Furthermore, if the random process is Gaussian, then the two power spectra and CP form a complete description 
of the random process.'^ These power spectra are therefore the only thing a Gaussian theoretical model needs to 
predict about CMB polarization. Fortunately, theoretical models are capable of computing predicted power spectra 
with great precision and speed |2^, |2^, for instance using the publicly available CMBFAST software [31]. 

Of course, actual observations always involve convolving the true polarization field with the telescope beam. As 
long as the beam is azimuthally symmetric and purely co-polar, this results in a simple replacement of Cf'^ with 



There is also the cross-correlation between the E component and the temperature anisotropy. We choose to focus exclusively on 
polarization data in the present paper, though, ignoring temperature data. 
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Cf'^bf, where bi is the Legendre transform of the beam pattern. For a Gaussian beam of width (Tbcam (FWHM 
= crbcam%/81n2), we have 

= exp{~l{l + l)al,^). (12) 

Throughout this paper, we wiU use Cf'^ to denote the beam-smoothed power spectra. 

In the next section, we wiU consider the analysis of data from a hypothetical CMB polarization experiment. The key 
ingredients in the analysis are the real-space correlations between measurements at different points, ((5(xi)(5(x2)), 
(?7(xi)[/(x2)), and (Q(xi)C/(x2)), which can be expressed as sums over the power spectra. Specifically, if Q and U 
are defined with respect to coordinate axes such that the x axis joins the two points, then 



(Q(xOQ(x2)) - 



21 + 1 
47r 

2/ + 1 



47r 



(Cf Fiz(xi • X2) - Cf ^^2KX1 • X2)) , 
(CfFi,(xi •X2)-Cfi^2/(X1 •X2)) , 



(C/(X1)C/(X2)) = J2 

I 

(Q(xi)C/(x2)) = 0, 

where the functions Fu and F21 can be expressed in terms of Legendre functions as 



{l+2)x 



P 



Fu{x) = 2 



F2i{x) = 4 



{l-l)l{l + l){l + 2) 

' {l + 'i)Pl,{x)-{l-l)xP^{x) 
^ (/-l)Z(/-t-l)(?-f 2)(l-a;2) 



(13) 

(14) 
(15) 

(16) 
(17) 



In practice, we often wish to know the correlations in some other coordinate system. Since we know how (Q, U) 
transforms under rotations, we can easily get these correlations by applying the appropriate rotation matrices to the 
correlations given above. (A pleasingly explicit recipe for doing this can be found in ||27[|.) 

III. FORMALISM 
A. Likelihoods and Fisher Matrices 

The Fisher information matrix provides a useful way to quantify the ability of a data set to estimate parameters. It 
has been applied to great effect in the study of CMB temperature anisotropy [e.g., [^) and also to CMB polarization 
studies p^, p3|. In this section, we show how the Fisher matrix can be used to calculate the significance with which 
the amplitudes of the E and B power spectra can be measured from a polarization map. 

Suppose we have made maps containing Q and U measurements at N pixels. The pixel locations are xi, . . . ,XAr. 
Our 27V data points can be written 



Ui 



U{xi) + eui 



(18) 
(19) 



Here i ranges over pixels in the map, and eg^ (efji) is the noise in the ith pixel of the Q (U) map. We will assume 
uncorrelated noise: 



{f-Aif-Bj) — C^Ai^ABSij- 

Here A and B range over {Q, U}, and is the noise variance in pixel i of map A. 



(20) 



We will arrange our data points into a data vector. 



/qi\ 



Ul 



(21) 



6 



The 2A'' X 2A'' data covariance matrix is defined to be M = (dd^). We can label elements of the data vector with a 
pair of indices iA, with i ranging from 1 to A'^ and A being cither Q or U. Then a typical element of the covariance 
matrix is MiAjB- The covariance matrix contains contributions from signal and noise 



MiAjB = M,%s + MI'ajb, (22) 



with 



MfgjQ = «;q(x,,x,) = (Q(x,)Q(x,)), (23) 

Mfuju = «^c/(xi,x,)=([/(xi)C/(x,)), (24) 

Mfgju = «^x(xi,x,) = (Q(xOf/(x,)), (25) 

= ^'hSijSAB. (26) 

As described in the previous section, the correlation functions wq,wu,wx can be expressed as sums over the power 
spectra. 

A theory predicts a pair of power spectra Cf and and hence a covariance matrix M. The likelihood of a theory 

is 

£(M) (X (dot M)-i/2 exp (^-^d'^M-^dj . (27) 

It is more convenient to work with the quantity 

L = Indet M + d^M'^d = -21n£ + const, (28) 

which can also be written in the following convenient form: 

L = Tr (In M + M-^d"^) . (29) 

(The logarithm of a matrix is as usual defined via the Taylor series, or equivalently by diagonalizing the matrix and 
taking the logarithms of its eigenvalues.) 

Now suppose that we are considering a class of theories that contains P unknown parameters ai...,ap, and 
suppose for simplicity that the covariance matrix is linear in these parameters, so that we can write 

p 

M = + ^(ap - 1)M^P\ (30) 
p=i 

Here M^^^ represents the (unknown) true covariance matrix. In other words, the true values of the parameters have 
been taken to be one. Our ability to measure parameters will be determined by how sharply peaked the likelihood is 
about its max;imum, so we perform a Taylor expansion in L to determine this. If we let dp stand for 8/ dap, then 

dpL = Tr {M-^dpM - M-\dpM)M-^dd^) = Tr (m-^M^^) - M'^M^P^M-^dd^^ . (31) 

Note that in the ensemble average, (dd^) = M^°\ so (dpL) — when all parameters are equal to one. 

The quantity that characterizes the sharpness of the likelihood peak is of course the second derivative, so we must 
plunge ahead and differentiate again: 

dgdpL = Tr (-M-^M^^'M-^M^p) + M-^M^^^M-^M^P^M-Md^ + M.-^M^P^M.-^M^i^M-^dd'^^ . (32) 

Let us take an ensemble average of this quantity and evaluate it at M = M(°) (which is both the true value and the 
ensemble-average maximum-likelihood location). Then 

2F,p = {d,dpL\^^^,,,) = Tr (mW"'m(«)m(o)"'mW) (33) 

The quantities Fgp are the elements of the P x P Fisher matrix F. They tell us the expected uncertainty with which 
a parameter ap can be determined from a likelihood analysis. Specifically, if all parameters except the pth are known 
a priori, then ap will be determined with an expected error of Xj^fF^. If on the other hand all parameters are 
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unknown, then the uncertainty is in ap is ^y {F~^)pp. Furthermore, these expected uncertainties are the smaUest that 
can be obtained from this data set by any unbiased data analysis method. 

We will be interested in determining whether the E- and B-component polarizations can be detected by a given 
experiment. Let us make an admittedly optimistic assumption: suppose that the shapes of the power spectra and 
CP are known but the amplitudes are not. Then we will have two parameters as and as, such that 

Cl'''' ^aE^BCf'"", (34) 

where Cf'^ represents the true power spectrum. Then the matrices M*^^^ and M*^^^ that appear in equation (|3^ ) 
are simply the parts of M (and in particular M'^) proportional to Cf' and CP . 

We will say that the E (B) component is detectable if the parameter as (c^b) can be measured with an expected 
uncertainty considerably less than one. Specifically, if has an expected uncertainty of i, then we can expect to 

measure the amplitude of Cp with a signal-to-noise ratio of q. So if, say, we are interested in knowing whether a 
particular experimental design will provide a 3-sigma detection of the E component, we simply compute the 2x2 
Fisher matrix F and determine whether ^/JF^ EE > 3. 



B. The JKW Approximation 



In the next section, we will present experimental requirements for detecting the E and B components, using the 
formalism described above. This process is somewhat laborious, since it involves manipulating 2N x 2N matrices, so 
simpler methods are clearly desirable. One very useful such approximation has been provided by Jaffe, Kamionkowski, 
& Wang (hereinafter JKW) [Q. Similar results may be found in In this section, we present a brief heuristic 
"derivation" of the JKW approximation. 

First, consider an experiment in which the N pixels cover the entire sky uniformly. Let us also suppose that the 
noise level is the same in each pixel of the Q and U maps. (In other words, cr^ is the same as a\^ of equation (|20| ) 
and is assumed to be the same across all pixels of both maps.) The analysis of this experiment is quite simple: we 
can estimate each coefficient a±2,im, and hence each Eim and Bim, independently by exploiting the orthonormality of 
the spherical harmonics over the whole sphere. Each coefficient will have noise of amplitude 

= (35) 
It is convenient to define the weight w of a set of Q and U maps to be'' 

N 

^^T. E ^A-- (36) 

i=l A={Q,U} 



The results of this section depend on the assumption of uniform noise, so w = 2N/a'^. Equation (35) can therefore 
be written 



-2 = -. (37) 
w 

If we wish to estimate the E-component power spectrum Cf , therefore, we have at our disposal a set of independent 
Gaussian random numbers (estimators of the true coefficients i?;m) with zero mean and variances 

{\Ei„,\')^CP + <tI (38) 

Since all the variables are independent, the likelihood has the usual Gaussian form: 

^ « (UiCF + al)] exp U J: if^) . (39) 

\t.m 1 \ Lm ^ * I 



Beware: this definition differs by a factor^jr from the weight as defined in JKW. While we're on the subject, note that the polarization 
power spectra in JKW and also in Ref. |22[ differ by a factor of two from those found elsewhere in the literature and in this paper. 
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If we assume as before that only the overall normalization of the power spectrum is unknown, we can determine its 
expected uncertainty just as in the previous section, by computing the Fisher matrix (which is diagonal for a full-sky 
experiment). Differentiating the above expression for £ twice, we find that the uncertainty in the parameter is 
given by 

Here SNR stands for "signal-to-noise ratio." An experiment with an SNR of, say, 3, is one in which we would expect 
to measure the power spectrum normalization with an accuracy of three sigma. Of course, an identical relation applies 
to as- 

This result has a very simple interpretation [ p5| . The quantity wCf /Sir is the square of the signal-to-noise ratio 
for a single mode at multipole I. Modes for which this quantity is much greater than unity contribute 1 to the sum, 
while modes for which it is much less than one contribute nothing. Therefore, what this formula says, roughly, is that 
the square of the signal-to-noise ratio of the power spectrum amplitude is equal to ^ the number of modes that are 
detected with high signal-to-noise. 

Now consider an experiment that only covers a fraction of the sky /sky- The number of independent modes that 
can be detected will of course be reduced by a factor /sky. On the other hand, the noise variance for each mode will 
be reduced by the same factor, since the total weight of the experiment is concentrated in a smaller area. We might 
guess, therefore, that the generalization of equation ( ^ ) to the case of partial sky coverage is 

(57Vi?)2 %y + ^ ^„ . (41) 

Equation ( ^l| ) is the JKW approximation (compare to equation (1) of JKW). 

A question arises as to the lower limit of the sum. JKW advocate starting the sum at Inun = 180°/L, where L 
is the survey size, on the grounds that modes with smaller I cannot be probed. It can be argued, though, that the 
sum should begin at the lowest possible / (namely 2). After all, the fact that modes with low / are not probed is 
already accounted for by the inclusion of the prefactor /sky The effective number of independent modes below some 
multipole Iq is approximately /sky^O' which is better approximated by /sky X]2°(2' + 1) than by /sky X);" (^^ + !)■ The 
choice made makes little difference to the final results (at most about 20%, and usually much less, for the results to 
be shown below), so for consistency we follow JKW's prescription. 

Of course, the JKW approximation does not account for the mixing of E and B modes due to incomplete sky 
coverage. We therefore expect it to overestimate the detectability of the subdominant B component, since some 
modes that are used to estimate B will be swamped by contamination from the larger E component. As we will 
see below, this is indeed the case. Perhaps more surprising is the fact that the JKW approximation sometimes 
underestimates the detectability of the E component. In the next subsection, we will consider an enhanced version of 
the JKW approximation that sheds some light on the reason for this. 

C. A toy model of E-B mixing 

In this section, we will consider an experiment that covers a square patch of sky that is small enough to permit the 



use of the flat-sky approximation. Suppose as usual that the sky polarization is a Gaussian random field p 
It can of course be written as a Fourier transform. 



U 



/ d2fcp(k)e^''-, (42) 



and each Fourier component can be split into an E and a B part: 

The assumption that the polarization is a realization of a homogeneous, isotropic, parity-invariant Gaussian random 
field means that E and B are Gaussian random variables with zero mean and covariances 

(^(k)^*(k')) = j^c'imk-k'), (44) 
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1 



(i?(k)i?*(k')) - ^C^(fc)(5(k-k'), (45) 
(£;(k)B*(k')> = 0. (46) 



The power spectra and depend only on the magnitude of k. The factor of (27r)^ is inserted for consistency 
with standard normahzation conventions: C^'^{k) « C; ' with I = k. 

Now, suppose that the polarization is measured over a square patch of sky of area L'^. We might^ choose to analyze 
such a data set by expanding the observed region in a Fourier series with coefficients 

a(q) = -^^^(i'xp(x)e-*'i-, (47) 

where q = {2Tr / L)(nx,ny) for integers rix and ny. 

These Fourier series coefficients are related to the true Fourier transform in the usual way, 



a«,)=/ 



d2fcp(k)W(k-q), (48) 



W{j) = ( "'""^'T ) ■ (49) 



where the window function is 

W(^\ ={- 

7^i/2 ; V lvL/2 

In other words, each mode a(q) probes a range of k values of width ~ L^^ around q. 

Suppose that we wish to estimate the E and B power spectra from these Fourier coefficients. We might proceed by 
decomposing each a(q) into an "E" and "B" piece: 

However, since a(q) contains contributions from a wide range of k's, not all of which are parallel to q, these will not 
really be purely E or B. In fact, the mean-square value of the supposedly E component is 

{\aE{cOf) = f (fk {C^{k) cos^ 2a + C'ik) sin^ 2a)W\k - q), (51) 

(^tt)^ J 

where cos a = k • q. A similar equation holds for the B component. 

When q is large compared to 1/L, of course, all values of k that contribute significantly to the integral are quite 
close in direction to q, so sin^ 2q! « and a£;(q) really does depend almost entirely on the E component. In other 
words, mixing of E and B is relatively unimportant on small scales. But for small q, a cannot be taken to be small. 
In fact, if we assume that the power spectra are approximately constant over the range where the window function is 
large, we can pull them out of the integral to find 

{\aE{ci)\') « {C'iqK + C'^iqK) , (52) 
where Cq and Sq are the averages of cos^ 2a and sin^ 2a weighted by W^^(k — q). Similarly, 

(laB(q)l') « {C'iqK + C^iqK) • (53) 
We can find these averages by numerical integration. Averaging over the direction of q, we get approximately 

1-^ = ^=^- 



In fact, we probably would not analyze the data in quite such a naive way; we would at least taper the edges to reduce ringing in the 
Fourier modes. But we're trying to keep things simple. 
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FIG. 3: E and B Power Spectra. The solid curves are the ACDM models with no reionization described in the text. From top 
to bottom, T/S takes the values 0.1, 0.05, 0.01. The dashed curve is a reionized model with r = 0.3, T/S = 0.1, and n tilted 
to 1.15. 



(If we are a bit more sophisticated and taper the edges of the data before Fourier transforming, then the window 
function W doesn't ring so much, and can be reduced by ^ 10-20%. As we will see in the next section, though, 
this approximation gives surprisingly good results as is.) 

Now suppose that ^ . Then for each q with a reasonably large value of we will have two independent^ 
modes that are dominated by E, rather than the expected one: according to equation (|5^), even the nominal B mode 
aB(q) is mostly E! Since, as we have seen in the previous section, the signal-to-noise ratio is determined by counting 
the number of modes with high signal-to- noise, this enhances the significance with which ue can be measured. 

In fact, we can modify the JKW approximation to take this into account. Each "nominal E mode" a£;(q) provides 
a measurement of the E-mode with mean-square amplitude Cf" and mean-square noise Sirfsky/w + CP sf. (Here 
|q| = I. We are imagining an attempt to measure the E-component, so we treat the B-component part of the signal 



as if it were noise.) We can therefore replace ^irfsVy/wCf" in equation (41) with 



And we should also include a term that accounts for the fact that aB(q) can be used to measure the E-mode amplitude: 

. <^-MZl^?^' ^ ,56, 



The final result is 



iSNH)i = if + 1, + . ,57) 



Although this looks quite messy, the interpretation is fairly simple, k'^i^ and k^;^ are the squared signal-to-noise 
ratios with which the E-componcnt amplitude can be detected in each mode (counting the B-componcnt contribution 
as noise). Modes with high signal-to-noise contribute one to the sum, while modes with low signal-to-noise (or low 
E-signal-to-B-signal, since B-signal is being counted as noise) contribute zero. In the no-mixing limit, sf — > 0, the K21 
term doesn't contribute, and the ku term agrees with equation (Ell). 



It can be shown that correlations between the different modes are relatively weak. This is not terribly precise, but after all this section 
is called "a toy model" ! 
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FIG. 4: E and B component detectability for different tensor-scalax ratios. In this figure and the next, the vertical axis is the 
observing time in detector- years, at a nominal sensitivity of 100 /xKs^^^, required for a 3-sigma measurement of the power 
spectrum amplitude. The beam size is (Tbcam ~ 0.3° (FWHM=0.71°). The points are the results of the Fisher matrix analysis. 
The three different symbols are for maps with pixel sizes of 0.18°, 0.3°, and 0.5°. The solid line is the result found with the 
JKW approximation. In the E-component case, the result of the enhanced JKW approximation is shown as a dashed line. 



Incidentally, we now face the same question as in the original JKW approximation regarding the lower limit of the 
sum. As before, it makes relatively little difference whether we start the sum at 2 or at 180° /L. In the results shown 
below, we have started the sum at 2. 

We can try to use this approximation to find the B-component detectability, but it turns out to give terrible results. 
This is not surprising: because we can always measure the E component more accurately than the B component, 
modeling it simply as an unknown source of noise is a poor approximation. 
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FIG. 5: E and B component detectability for different beam sizes. The results shown here all have T/S = 0.01. The symbols 
have the same meanings as in Figure ^ except for the pixel sizes: for a beam size of 0.1°, the different symbols represent pixel 
sizes of 0.06°, 0.1°, 0.15°, while for the 0.5° beam the symbols represent 0.3°, 0.5°, 0.8° pixels. 



IV. RESULTS 



Consider an experiment in which a square patch of sky of length L is observed. We will suppose that the patch is 
pixelized into an A^sido x .^side square array and that both Q and U are measured in each pixel, so that the dimension 
of our data vector d is 2^^^,^^^^,. Suppose furthermore that the noise level a is the same for all of these measurements. 
The weight of the experiment is then 

9 

w= (58) 

The weight is of course proportional to the observing time: w = A^diobs^o /s^, where is the number of detectors 
and fobs is the observing time. Tq = 2.728 K is the current CMB temperature. Its presence is simply to convert units: 
we choose to measure the sensitivity s in temperature units but the power spectra in dimensionless AT/T units. 
Instead of quoting weights, therefore, we can simply quote observing times in, say, detector-years at some nominal 
sensitivity level. In the results below, we will use s = 100 /iK s^^^ as our nominal sensitivity, so our observing times 
in detector-years will simply be 

f tohs \ /100AiKsi/2\^ w 
Detector-years - j [-^^ j = 2:35^- ^''^ 

We consider a "concordance" ACDM cosmological model with parameters chosen for reasonable agreement with 
the bulk of the available data: h = 0.70, il^ = 0.041, flj^ = 0.7, fltot = 1, n = 1. In most of our results, we ignore the 
effects of reionization by setting the optical depth r to last scattering equal to zero. We allow the tensor-to-scalar 
ratio T/S, defined as the ratio of the tensor and scalar temperature power spectra at / = 2, to vary. E and B power 
spectra for these models are shown in Figure ^. 

Once all of the above parameters have been specified, we can compute the E and B component power spectra 
with CMB FAST, COBE- normalize [31, 0, and smooth with a Gaussian beam (Tbcam- We then use the formalism 
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FIG. 6: Comparison of JKW and Fisher matrix detectabilities. The quantity on the vertical axis is the ratio of the required 
B-mode detection time calculated using the JKW approximation to that calculated from the Fisher matrix. On the horizontal 
axis is the survey size measured in units of the B-mode coherence length defined in the text. The solid lines are the abeam = 0.3° 
models from Figure Q T/S decreases from top to bottom. The dotted and dashed lines are the 0.1°- and 0.5°-beam models 
plotted in Figure ^. The kinks in the curves are caused by the fact that different pixel sizes were used in the maps. 



of Section pi A| to calculate the signal-to- noise ratios for the amplitude of the E and B components. Since we are 
interested in determining the required experimental parameters for a strong detection of the two amplitudes, we fix 
the desired signal-to-noise at 3 and solve numerically for the required weights. The results are shown in Figures ^ 
and 1^. For comparison, we also show the result of the JKW approximation and, for the case of the E component, the 
result of the "enhanced" JKW approximation of equation ( |57| ) . Although most of our results were calculated with 
fboam — 0.3°, we calculated the T/S = 0.01 power spectra for (ibcam = 0.1° and 0.5° as well to illustrate the effect of 
beam size on our results. 

In the Fisher matrix analysis, we analyze maps with several different pixel sizes, as indicated in the figure captions. 
For the 0.3° beam, for example, the largest pixel size is 0.5°, which is somewhat larger than one would ideally like to 
use. We use this large pixel size so that we can investigate data sets with large sky coverage without having to invert 
inconveniently large matrices. As shown in the figures, there is overlap in map size between different pixel sizes, which 
gives a crude idea of the amount of information being lost when large pixel sizes are used. In general, the bulk of the 
information is found on scales larger than the pixel scale; for maps with a reasonable number of pixels, the fact that 
we are undersampling the beam does not appear to affect the results very much. 

Note that the JKW approximation underestimates the difficulty of detecting the B component as expected. For 
small survey sizes, though, it overestimates the difficulty of measuring the E component amplitude. The enhanced 
JKW approximation provides a better fit in this regime. 

The detectability of the E component is quite flat as a function of survey size even for fairly small maps, but that 
of the B component is not. For the case of a 0.3° beam, the optimal survey size for detecting the B component 
is 22°, as compared to 17° found via the JKW approximation. For CTboam — 0.5°, the optimum is 26°, while the 
JKW approximation would give 19°. For the (ibcam = 0.1° case, the optimum survey size was too large to find 
with the Fisher-matrix formalism. The JKW optimum for this case occurs at L = 15°, with a detection time of 30 
detector-years; it is reasonable to suppose that the true optimum is at even larger scales. 

While it may not be necessary for an experiment to achieve the optimum survey size, one would certainly like to stay 
well to the right of the "knee" in Figures^ and ^. For the case of a 0.3° beam, the required detection time is relatively 
flat for L > 12° but rises sharply at smaller L. This suggests that a map of at least 40 x 40 (Xbeam-sized pixels should 
be made in order to measure the B-component amplitude in such an experiment. The experimental weight required 
for a 3-sigma B-component detection in such an experiment would be such that the B-component-signal-to-noise per 
pixel is approximately unity. 

In order to determine whether reionization made a significant difference to our results, we examined a model in 
which the optical depth to last scattering was r = 0.3. In order to keep the temperature power spectrum more or 
less the same, we tilted the spectral index n from 1 to 1.15. As shown in Figure |^, this makes a large difference to 
the polarization power spectra, but most of the difference is on larger scales than those to which our degree-scale 
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experiments are most sensitive. The required detection times for this model were very similar to those in the no- 
reionization case: for the E-component, the difference was less than 4%, while for the B-component it ranged from 
15% down to 5% over the range of survey sizes shown in Figure ^. 

V. DISCUSSION 

As we have seen in the previous section, the Fisher-matrix formalism gives a slightly better detectability for 
the E-component power spectrum amplitude than is predicted by the no-mixing JKW approximation, at least in 
the case of small survey sizes. The reason for this is that some modes that "should" probe the B component are 
actually dominated by the E component, increasing the number of independent modes that can be used to estimate 



the E-component amplitude. The enhanced JKW approximation (57) does surprisingly well at characterizing the 
E-component detectability, confirming that this "extra mode" explanation is indeed correct. 

The main practical consequence of the enhanced E-component detectability is that experiment designers need not 
be terribly concerned about ensuring large sky coverage: the detection requirements are fairly flat as a function of 
survey size. For the B component, on the other hand, survey size is quite important. The required observation time 
rises sharply as the survey size decreases. For a 0.3° beam, the "knee" in detectability occurs at a survey size of about 
12°. 

As expected, the neglect of mixing in the JKW approximation causes it to underestimate the difhculty of measuring 
the B component. We might expect this underestimation to depend primarily on the survey size, measured relative 
to some sort of coherence length of the signal being looked for. In Figure ^, we illustrate this by plotting the ratio of 
the detection time calculated using the JKW approximation to the Fisher-matrix calculation. On the horizontal axis 
is plotted the survey size, measured in units of the B-component coherence length. The coherence length is defined 
by ^coh — "■/^coh, where the coherence multipole is 

the weighted average multipole, with weights given by l{2l + 1)CP . (Here is the beam-smoothed power spectrum 
as usual.) All of the models discussed in the last section are plotted here. In general, the JKW approximation 
becomes good when the survey size is many coherence lengths, but for L ^ 20 coherence lengths, it underestimates 
the detection difficulty by a factor of ~ 2.5. 

It is important to note that the Fisher-matrix formalism used in this paper gives the best possible error bars that 
can be achieved from a given experiment. Even if, for example, we expand the data set in some set of normal modes 



that minimize E-B mixing |2^, 27, we cannot do any better than the brute- force likelihood analysis of the entire 
data set on which the Fisher-matrix formalism is based. This is not to say, of course, that such methods are not 
useful. They may reduce computation time, provide insight into the nature of the E-B decomposition, and allow 
filtered real-space maps of the E and B components to be made, for example. 

Although the Fisher-matrix results represent the idealized minimum possible error bars, they are likely to be quite 
close to the errors achievable in real experiments.^ After all, lossless or nearly lossless methods such as those that 
have been applied to recent temperature anisotropy measurements [Q, ^ can be easily adapted to the polarization 
case. The other idealization that has been made in this analysis is that only the amplitudes, and not the shapes, 
of the power spectra are unknown. This will of course not be true in a real experiment (although, as long as CMB 
temperature anisotropy experiments continue to return results consistent with standard models, strong constraints 
on the shapes will be available); however, for the first few experiments to detect the E and B modes, a band-power 
estimate of the power spectrum in a few bands will probably be sufficient and may be expected to yield error bars 
similar in size to those calculated by the Fisher-matrix formalism. 

E-B mixing is significant only on scales comparable to the size of the survey, and therefore affects a relatively small 
number of modes p7| , p8| . It might therefore be surprising that mixing has as large an effect on the B-component 
detectability as it does. The main point to remember in this regard is that the modes with wavelengths of order the 
survey size are always the ones that are detected with the highest signal-to-noise. (After all, the beam-smoothed CP 
is invariably a sharply decreasing function of I, whereas the noise has a flat power spectrum.) The loss of these modes 
to mixing therefore has a disproportionately large effect. 



Unless foreground contamination, which is not treated in the present work, proves to be a serious problem. 
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It should be noted that the criterion adopted for "detectabiUty" of a component in this paper is that the amphtude 
of that component's power spectrum can be measured with small fractional uncertainty. It is possible for a component 
to be "detected" (i.e., for the null hypothesis that the component is absent to be ruled out) at high significance even 
if this criterion is not met. For instance, if a single mode is detected with high signal-to-noise, and if that mode is 
known to be a pure B-mode, with no E-component contamination, then the B-component will have been detected with 
high significance. However, based on a single mode, the amplitude of the power spectrum cannot be determined with 
fractional uncertainty less than 0(1), so such an experiment would not meet the detectability criterion considered in 
this paper. 

This scenario could easily occur in an experiment designed to measure the Stokes parameters in a thin ring |2^ , 
p5| , p6| . Because an accurate characterization of the power spectrum amplitudes will be extremely important in 
interpreting polarization results, and because a detection of the polarization in many different modes is much more 
robust than a detection in only a few, we have chosen to adopt the stronger detectability criterion of this paper. 

The formalism described in this paper has numerous applications. Although we have considered only square maps, 
it can of course be used to explore the effects of survey geometry on detectability of the two components. It can also 
easily be adapted to examine the degree to which inhomogeneous noise alters detectability. One might suppose, for 
instance, that measuring the edges of the survey with high precision would help in separating the two components, 
since the modes that are "ambiguous" with respect to the E-B split tend to be supported most strongly near the 
boundary p^ . Finally, the formalism described herein may prove useful in studies of weak gravitational lensing {e.g., 
p5| , |3^): the shear induced by lensing is a spin-2 field, and the mathematics is therefore quite closely analogous to 
the case of CMB polarization. 
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